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We discuss the influence of fixed target Drell-Yan data on the extraction of parton distribution 
functions at next-to-next-to-leading order (NNLO) in QCD. When used in a parton distribution fit, 

■ the Drell-Yan (DY) data constrain sea quark distributions at large values of Bjorken x. We find 

^ ' that not all available DY data are useful for improving the precision of parton distribution functions 

, (PDFs) obtained from a fit to the deep inelastic scattering (DIS) data. In particular, some incon- 

fT^ . sistencies between DIS-based parton distribution functions and DY data for large values of dilepton 

^S) ' rapidity are found. However, by selecting a sample of the DY data that is both representative and 

^ consistent with the DIS data, we are able to perform a combined PDF fit that significantly improves 

. the precision of non-strange quark distributions at large values of x. The NNLO QCD corrections 

' to the DY process are crucial for improving the precision. They reduce the uncertainty of the theo- 

' retical prediction, making it comparable to the experimental uncertainty in DY cross-sections over 

fH , a broad range of x. 
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I. INTRODUCTION 

Parton distribution functions (PDFs) arc important for the theoretical description of hard QCD processes at hadron 
coUiders. Due to the factorization of short- and long-distance physics, these functions are universal, and once extracted 
from one process they can be applied to other hard QCD processes. With the Tevatron Run II under way and the 
LHC upcoming, the need for reliable PDFs is increasing. Particular aspects that warrant careful investigation are 
PDF uncertainties and their influence on theoretical predictions, and the consistent inclusion of higher order QCD 
corrections into PDF fits. 

The current standard for perturbative calculations in QCD is next-to-leading order (NLO). The typical accuracy 
of this approximation is 10 — 15 percent. While this level of precision is adequate for many physics processes that are 
studied at the Tevatron and will be studied at the LHC, there are processes for which higher accuracy is required. This 
may happen for either calibration processes or important discovery channels, such as the production of electroweak 
gauge bosons, the Higgs boson, heavy quarks, and two jets with large transverse momenta. For such processes, it is 
desirable to have a theoretical description valid through next-to-next-to- leading order (NNLO) in perturbative QCD. 
A significant effort is currently under way to develop theoretical tools for computing parton scattering cross-sections 
with NNLO accuracy. To use those calculations for predicting actual hadronic cross-sections, parton distribution 
functions with NNLO accuracy are required as well. 

There are currently two distinct approaches to extracting PDFs from existing data. The first one is the global fit 
that is practiced by the MRST and CTEQ collaborations. The data set in this case includes deep inelastic 
scattering (DIS), Drell-Yan (DY) pair production in fixed target and collider experiments, and Tevatron jet cross- 
sections. While such an approach benefits from the wealth of data, its drawback is that inconsistent data may influence 
the quality of the fit. In addition, going beyond the next-to-leading order within this framework is difhcult since very 
few partonic processes are currently known through NNLO in perturbative QCD. 

A different approach to extracting PDFs was suggested in Q . The data set in this case is restricted to deep inelastic 
scattering. Higher order QCD corrections can be included consistently within this approach since the QCD corrections 
to DIS coefficient functions and DGLAP splitting functions are known through NNLO 0, Hi 0] . The disadvantage of 
the DIS-based approach is that the DIS data are only sensitive to certain combinations of PDFs. Consequently, not 
every parton distribution function can be reliably constrained. This leads to large, approximately 20%, errors on sea 
quark and gluon distributions at relatively large values of the Bjorken variable x, x ^ 0.1. 

The determination of sea quark distribution functions can be improved if the approach of Ref. |3| is extended to 
include precise data on fixed target Drell-Yan processes OSS ^| ■ These data cover the important kinematic range 
~ (20 GeV)^ and x ^ 0.1, and are strongly sensitive to sea quark distributions in the proton. While such an 
extension seems obvious, Drell-Yan data was not incorporated into the NNLO fit of Ref. ^ because until recently 
only the NLO calculation of the dilepton rapidity distribution in the Drell-Yan process was available [iJl . Recent 
NNLO QCD computations [H El of the rapidity distribution remove this obstacle and permit consistent inclusion 
of the Drell-Yan data in the PDF fit. The purpose of the present paper is to perform a combined analysis of the DIS 
and DY data and to elucidate the impact of the DY data on parton distribution functions. 

This paper is organized as follows. In the next Section we investigate the consistency of fixed target DY data 011,13 
and theoretical predictions obtained with the DIS PDFs of Ref. |3l ■ This consistency is the necessary condition for 
combining the DIS and DY data; if it is not fulfilled, the errors on parton distribution functions obtained in a 
combined fit are meaningless. We show that available DY data are precise enough so that it is beneficial to include 
these data in a combined DIS/DY fit. Having established the consistency of the DIS PDFs with the DY data, we 
incorporate those data in a combined DIS/DY fit which is described in Section III. Inclusion of the DY data into 
the fit improves the precision of sea quark distribution functions for large values of x. The quality of the DIS/DY 
fit is similar to the quality of the DIS fit of Ref. ,3| . We discuss implications of the combined fit for basic QCD and 
electroweak observables such as the value of the strong coupling constant as(Mz), the Pascos-Wolfenstein ratio and 
the production cross-sections of Z and W bosons at the Tevatron. Finally, we present our conclusions. 

II. DIS PARTON DISTRIBUTION FUNCTIONS AND THE DY DATA 

As we discussed in the Introduction, before incorporating fixed target DY data into the PDF fit based on the DIS 
data, we need to check if those data sets are consistent. To do so, we compute the dilepton rapidity distribution 
for fixed target DY processes using the DIS PDFs |^ and compare the results of the calculation to experimental 
data 0, 0, I3, ^ • We assume that dimuon production in the Drell-Yan process is well described by the leading 
twist factorization and that nuclear corrections are unimportant. There are then two sources of uncertainties in the 
theoretical prediction. First, there is residual dependence on the factorization and rcnormalization scales, a feature 
common to all fixed order calculations in perturbative QCD. Second, parton distribution functions obtained from a fit 
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to data have systematic uncertainties that influence the theoretical prediction of the dimuon rapidity distribution. For 
the fixed target DY processes that are considered in this paper, PDF uncertainties are larger than the residual scale 
uncertainty of the NNLO calculation. We are therefore mostly concerned with PDFs uncertainties in what follows. 

We choose three sets of fixed target DY data for our analysis 0) S IS ^1 • experiments use an 800 GeV proton 
beam but employ different targets such as hydrogen (E-866), copper (E-605) and deuterium (E-772, E-866). The 
center-of-mass energy of the DY process for these three experiments is y/s = 38.8 GeV. These experiments therefore 
cover a broad range of dilepton invariant mass M and Bjorken x: M < 20 GeV and x > 0.01. Note that distributions 
in the Feynman variable xp rather than the dilepton rapidity are measured by E-772 and E-866; however, the only 
distribution known through NNLO in perturbative QCD is the dilepton rapidity distribution . We relate the x p 
distribution and the rapidity distribution using leading order kinematics. This procedure is justified, since for all DY 
experiments relevant for our analysis, the average value of the dilepton transverse momentum p± ~ 1 GeV is small 
compared to the dilepton invariant mass M > 5 GeV. We have checked that the use of leading order kinematics does 
not introduce significant bias in the final results. 

The sensitivity of parton distribution functions to the DY data can be understood from the analytic expression for 
the DY process at leading order in perturbative QCD. The double differential distribution in dilepton invariant mass 
M and rapidity Y can be written as 

^^^y -'^qiixi)q2ix2) + qi{xi)q2{x2), (1) 

where xi = Mj^e^ and X2 — M/^e^^ . Eq. implies that, at leading order, the rapidity distribution is 
determined by either annihilation of a valence quark from the projectile and a sea antiquark from the target or vice 
versa. Valence and sea quark distribution functions are determined from the DIS data with differing precision. The 
precision of valence quark distributions is a few percent for all values of x relevant for the DY and DIS data that we 
consider in this paper. Sea quark distributions are known from the DIS data with a few percent precision only for 
X ^ 0.1. For larger values of x the error increases rapidly and exceeds 20% fs^. Since the theoretical predictions for 
d^a/dYdM are more precise than this error jlllll2j |. sea quark distributions can be determined from Eq. with an 
accuracy comparable to the precision of the available DY fixed target data. 
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FIG. 1: The NLO (dashes) and NNLO (solid) dilepton rapidity distributions for proton-copper collisions, calculated with the 
DIS PDFs of Ref. 0], in comparison with the E-605 data at zero rapidity. The NNLO la uncertainty band due to PDF errors 
is displayed by the dotted curves. The relation between x\ and X2 for data points in the upper panel is shown in the lower 
panel. 



We begin by comparing theoretical predictions for the dilepton double differential distribution in invariant mass 
and rapidity with the E-605 proton-copper scattering data. The comparison is shown in Fig. ^ for the rapidity Y — Q\ 
note that different values of the dilepton invariant mass M contribute to this plot. In the lower panel of Fig.^ values 
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of xi and X2 are plotted assuming leading order kinematics. Theoretical curves are computed with the NNLO DIS 
PDFs we choose equal values for the factorization and renormalization scales and set them equal to the invariant 
mass of the dilepton pair. The theoretical band reflects the la uncertainty of the DIS PDFs. It is apparent from 
Fig-fflthat for xi^2 ^ 0.2, the data are more precise than the theoretical prediction and the data points are within the 
theoretical uncertainty band. The theoretical prediction shown in Fig. ^ does not include the uncertainty associated 
with the variation of the renormalization and factorization scales. This uncertainty is about ten percent and is much 
smaller than the 30% PDF error. It is clear from Fig. ^ that the E-605 data are consistent with the DIS data, and 
may therefore be used in the PDF fit with the DIS data. The precision of the PDFs obtained from a combined fit 
must improve compared to the situation when only the DIS data is fitted. We note that although Fig. prefers to a 
particular rapidity value, the E-605 data and the theoretical prediction based on the DIS PDFs are in agreement for 
other values of dilepton rapidity as well. 




FIG. 2; The same as in Fig. for the E-866 proton (upper panels) and deuteron (middle panels) data for dilepton masses 
M = 5.45 GeV (left panels) and M = 8.45 GeV (right panels). 

A similar analysis can be performed for the E-866 hydrogen and deuterium data; note that the E-866 data covers 
a broader kinematic range than the E-605 data. In this case, we arrive at two different conclusions depending on the 
invariant mass of the dilepton pair produced in the DY process. We find that for large dilepton invariant masses there 
is a reasonable agreement between predictions based on the DIS PDFs and the experimental data; this kinematic 
region is the same as covered by the E-605 data. However, for small invariant masses and large rapidities the E-866 
data are in systematic disagreement with theoretical predictions based on the DIS PDFs. The corresponding results 
are shown in Fig. [3 

We now discuss the region of small invariant masses in detail. From Fig. |2] we observe that the experimental data 
is lower than the theoretical prediction. The disagreement occurs in the region xi ^ X2 with X2 ^ 0.1. For such 
values of 2:1^2, qva.i{xi) ^ qva.iix2) and gsca(a;i) ^ qsca{x2)- The second term in Eq. is therefore negligible and the 
production cross-section is mainly determined by the sea quark distribution q{x2) with X2 ^ 0.1. However, for such 
values of X2 the precision of sea quark distribution functions obtained from the DIS data is close to a few percent 
0. We therefore conclude that for this kinematic range, the available DY data can not improve the precision of the 
DIS PDFs. Instead, the theoretical prediction for the dilepton rapidity distribution obtained with the DIS PDFs is 
a non-trivial check of the consistency of the data. It follows from Fig. |21 that this consistency check fails since the 
experimental data are systematically below theoretical predictions. We note that the NLO theoretical prediction is 
in better agreement with the data. While this is clearly accidental, it may result in misleading conclusions about 
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the compatibility of different data sets. Forcing PDFs to fit both data sets is a bad solution-*^; the PDFs obtained in 
that case result in rapidity distribution curves that pass between the DlS-based prediction and the E-866 data, the fit 
quality deteriorates and no reduction of the PDF uncertainty is achieved. We conclude that there is a contradiction 
between the DIS data and the small dimuon mass data obtained by the E-866 collaboration. In the region where the 
disagreement occurs, the PDFs are already known precisely from the DIS data. Hence, for such values of dilepton 
invariant mass the DY data does not improve the precision of sea quark PDFs. 

The disagreement between the DIS-based prediction and the E-866 data for small invariant masses occurs at large 
rapidities. This kinematic region is known to be problematic for existing fixed target DY experiments. In particular, 
there is a disagreement between the E-772 and E-866 deuterium data, with the E-772 data points being systematically 
higher. In principle, this is exactly what is needed to match the DIS-based prediction and the DY data, as can be 
seen from Fig. |21 However, as shown in Fig. |21 the E-772 data points are somewhat too high on average. 
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FIG. 3: Te same as in Fig. 0for the E-772 deuterium data and for the dimuon invariant mass M = 4.75 GeV. 



We suspect that problems with the large rapidity region originate from underestimated systematic uncertainties. If 
this is the case, the ratio of cross-sections for hydrogen and deuterium targets measured by the E-866 collaboration 
|lfl| | is useful since many systematic uncertainties cancel in the ratio. We note that the theoretical prediction for the 
ratio is also more precise; for example, the dependence on the factorization and renormalization scales, a ten percent 
effect in the theoretical predictions for individual cross-sections, disappears in the ratio. 

The E-866 results for the ratio of deuteron to proton cross-sections and the theoretical prediction based on the 
DIS PDFs are compared in Fig. 01 In this case, there is an agreement between theory and data for small invariant 
masses, whereas for larger invariant masses and larger values of Bjorken x, the shape of the DY data differs from 
the DIS prediction. However, this region is not really problematic for the consistency of the DIS and DY data since 
it is strongly sensitive to sea quark PDFs for x > 0.1, where sea quark PDFs obtained from the DIS fit suffer from 
large uncertainties Given the large PDF errors in the region of x where the disagreement occurs, we conclude 
that the E-866 data on the ratio of deuteron to proton cross-sections can be used together with the DIS data without 
sacrificing the quality of the fit. 

Having compared theoretical predictions based on the DIS PDFs with the data on DY processes, we briefly discuss 
changes that can be expected once the DY data is included in the fit. As an illustration, consider the E-866 data for 
the dimuon invariant mass M = 8.45 GeV, shown in Fig. |21 For larger values of X2, we observe that for both proton 
and deuteron targets the experimental data points are somewhat higher than the theory prediction. To make theory 
agree with experiment, we require that sea quark distributions for x ^ 0.1 increase. Moreover, since the disagreement 
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between theory and experiment is stronger for the proton data, the u distribution function should receive a larger 
increase than the d distribution. This observation is consistent with the results for the ratio of deuteron to proton 
cross-sections in Fig. 0] The ratio of the two cross-sections can be approximated by 



(2) 



It follows that since the ratio of computed cross-sections is higher than the experimental result, the ratio d/u should 
decrease. This is consistent with the information from the absolute measurement of proton-proton and proton- 
deuteron cross-sections. It is interesting to note that the d distribution function almost coincides for DIS PDFs Q 
and MRST PDFs whereas the u distribution function from the DIS fit is smaller than the one obtained by MRST. 
This is not accidental, since MRST includes the E-866 data in their fit. While the preceding discussion indicates 
how sea quark distributions are influenced by the DY data, it is less obvious that gluon PDFs at large values of x 
may also be affected. To see that this may happen, recall that the contribution of the qg partonic subprocess to 
the dimuon production cross-section is relatively large, approximately 15% of the total, and negative. Decreasing 
the gluon content of the proton may therefore increase the rapidity distribution. A similar effect can be achieved by 
increasing sea quark distributions. Since both gluon and sea PDFs at large x are poorly constrained by the DIS fit, 
the impact of the DY data on each of these distributions can not be disentangled using qualitative considerations. 
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FIG. 4: The same as in Fig.Qfor the deuteron to proton cross-section ratio measured by the E-886 experiment. Larger values 
of X2 correspond to larger dimuon invariant masses. 

Following the discussion in this Section, we include the E-605 data and the E-866 data on the ratio of deuteron to 
proton cross-sections in the combined DIS/DY fit. These two data sets improve the precision of sea quark distributions 
obtained from the DIS fit in two different ways. The E-605 data improves the precision of sea quark distributions for 
X ^ 0.2 in a "flavor-blind" fashion, whereas u — d for x ^ 0.1 is constrained by the E-866 data. Note that even if 
the E-866 and E-772 measurements of the absolute proton and deuteron cross-sections were consistent with the DIS 
data, they could not have added much new information compared to the DY data which we include in the fit. This is 
because at small rapidities the E-605 data is as good as the E-866 and E-772 data, while at large rapidities the PDFs 
are already constrained by the DIS data. We conclude that our selection of the DY data is sufficiently representative 
and can be combined with the DIS data to determine parton distribution functions with high precision. We describe 
the results of the combined fit in the next Section. 
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III. A FIT TO THE COMBINED DIS AND DY DATA 
A. Theoretical input 

In this Section we fit PDFs to both the DIS and DY data. We begin with a brief description of the sahent features 
of the approach in Ref. We use the following parameterization of parton distribution functions at Qq — 9 GeV^: 

S'Zv(x,Qo) = ^%^x"«(l-x^x^--(^), Pq,v = li..qx + ^2.gx^ q = u,d; (3) 

xqs{x,Qa) ^ AqX^'^^il- x)''^'x^"-^^''\ Pq,s^ji^qsX, q = u,d,s; (4) 

xG{x,Qo) = Agx-^{1-x)''^x''^^-\ Pg=7i,gx. (5) 

Valence quark distributions are displayed in Eq. sea quarks are shown in Eq. and gluons are shown in Eq. jSJl- 
To obtain PDFs at arbitrarvQ^, we employ the DGLAP evolution equation with the NNLO Altarelli-Parisi splitting 
kernels computed recently [g- The PDF parameterization in Eqs. ()3I5|I differs from the one in Ref. |3|. It allows 
more flexibility, which is important since more data are included in the fit. Note that some parameters in Eqs. H3I5() 
are inter-dependent. For valence quarks, Ny is calculated from the requirement that the total numbers of valence u 
and d quarks are two and one, respectively. Also, the normalization of the gluon distribution, Aq, is related to the 
other parameters through the momentum conservation constraint. Since the strange quark distribution is not well 
constrained by the data used in the fit, we fix it using the CCFR data on dimuon production in neutrino-nucleon 
collisions 14]. This leads to A^ = 0.08, bss = 7, and 71, ss = 0. We also set Uus = o-ds = Q-ss which is a natural choice 
since the existing DIS/DY data is not useful for detecting non- universality of sea PDFs at small x. The contribution 
of heavy quarks to DIS structure functions is accounted for within the massive factorization scheme using the one- 
loop computations of Ref. For the fixed-target DY data employed in the fit, heavy quark contributions are 
unimportant. 
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FIG. 5: Data points used in tfie DIS/DY fit vs. predictions based on fitted PDFs. Tfie bands reflect the la uncertainty of 
fitted PDFs. 



The DIS deuteron data are corrected for nuclear effects that include Fermi motion, shadowing and nucleon off- 
shellness. Since the deuteron nuclear correction increases with x, the cut x < 0.75 was applied to the DIS deuteron 
target data in Ref. 01 . Because uncertainties in nuclear effects at large x are now better understood we do 
not apply a similar cut in the current analysis and include the DIS data points up to x = 0.9, the largest value of 
x available in the existing DIS data. The DY data are not corrected for nuclear effects since these data points are 
concentrated at a; ^ 0.3, where nuclear corrections are small [T^ . 
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TABLE I: The number of data points (NDP) and values of x^/NDP for each experiment used in the fit. 



Experiment 


NDP 


xVndp 


Experiment 


NDP 


xVndp 


SLAC-E-49A 


fl8 


0.56 


BCDMS 


605 


f.lO 


SLAC-E-49B 


299 


f.l8 


NMC 


490 


f.26 


SLAC-E-87 


2f8 


0.94 


Hf (96-97) 


f35 


f.l3 


SLAC-E-89A 


f48 


f.42 


ZEUS (96-97) 


f6f 


f.28 


SLAC-E-89B 


f62 


0.80 


FNAL-E-605 


ff9 


f.49 


SLAC-E-f39 


26 


f.03 


FNAL-E-866 


39 


f.l3 


SLAC-E-f40 


n 


0.47 


Total 


2537 


f.l3 



Our treatment of power corrections to logarithmic evolution of the DIS structure functions follows Refs. We 
suppress the sensitivity of the structure functions to power- like terms by removing the DIS data with < 2.5 GeV^ 
and hadronic invariant mass < 1.8 GeV. For the remaining data, target mass corrections important at large x are 
applied using the Georgi-Politzer scheme [T^. Applying just the target mass corrections is insufficient. We must also 
add twist-4 terms to the DIS structure functions. These terms are parameterized by cubic spline polynomials of x 
whose coefficients are fitted to data. Note that twist-4 contributions produce only ~ 10% corrections to DIS PDFs 
even for ~ 2.5 GeV^ and become unimportant for ~ 20 GeV^. By analogy, since the DY data employed in our 
analysis correspond to > 25 GeV^, we do not consider power corrections to this part of the data sample. 

B. Results of the fit 

The PDF parameters in Eqs. (|3I5|I and the coefficients of the twist-4 corrections to the DIS structure functions are 
obtained from the fit to the DIS data for proton and deuteron targets 20] and the DY data of Refs. To check 

that our PDF parameterization is sufficiently flexible, we modified the polynomials Pq^cix) in Eqs. (|3I5|I by adding 
terms of the type jnx"^, n = 2, 3 for the sea, gluon and valence distributions. We found that such modifications 
do not improve the description of the data. The overall quality of the fit is good; for its final version the value 
xVNDP = 2862/2537= 1.13 is obtained. 




X 



FIG. 6: The la bands for isospin-symmetric and anti-symmetric sea quark distributions from the DIS/DY (solid) and DIS 
(dashes) fits. 

To demonstrate the quality of the fit in more detail, we show values of x^/NDP for separate experiments in Tabled 
It is clear from the Table that the description of the data is acceptable. In Fig. [S] results for the pulls of the DY data 
used in the fit are displayed. They do not demonstrate any systematic trend. The description of the E-605 data has 
randomly distributed deviations that can be attributed to fluctuations beyond quoted experimental errors. We can 
model the possibility of some experimental errors being underestimated by re-scaling the errors for experiments with 
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TABLE II: Parameters of parton distribution functions derived from the NNLO QCD fit to the DIS and DY data. The errors 
on fit parameters are obtained by propagating the statistical and systematic errors in the data. As described in the text, a„s 
and ada are identical by construction. 





«„ 




Us 


ds 


9 


a 


0.670 ± 0.035 


0.61 ±0.12 


-0.2182 ± 0.0044 


-0.2182 ±0.0044 


-0.198 ±0.015 


b 


3.639 ± 0.077 


5.21 ±0.42 


6.14 ±0.25 


8.24 ± 0.40 


5.41 ±0.13 


71 


-0.41 ±0.27 


0.18 ±0.27 


1.04 ±0.32 


-1.97 ±0.48 


2.09 ±0.94 


72 


-0.91 ±0.18 


-4.19 ± 0.18 








A 






0.1488 ± 0.0060 


0.1220 ± 0.0063 





X^/NDP > 1. We find that these scale factors do not exceed 1.2 and the impact of the re-scahng on the PDF errors 
is within 20%. 



Q^=9 GeV^ Q^=9 GeV^ 




FIG. 7: The same as in Figl^for up (down) quarks and gluons. 

Having estabhshed that the combined DIS/DY fit leads to an acceptable description of the data, we discuss the 
major differences between the DIS/DY and DIS PDFs. For this comparison, the DIS PDFs were re-calculated using 
the parameterizations shown in Eqs. (|3I5|I . Hence, the comparison presented below illustrates the differences in PDFs 
caused by the inclusion of the fixed target Drell-Yan data into the fit. 

As we discussed in Section ^ we expect sea quark distributions at large values of x to be mostly affected by the 
DY data. This is indeed what happens, as shown in Fig. IHl Both the symmetric and anti-symmetric combinations of 
u and d distributions are displayed. Dramatic improvements in the precision for large values of x are observed once 
the DY data are included in the fit. For x < 0.1, the impact of the DY data on the isospin-symmetric combination 
x{u + d) is marginal, whereas the precision of the combination x{d~ u) in the DIS/DY fit is higher for x > 0.02. The 
central values of the sea quark distributions obtained in the DIS/DY and DIS fits agree within the errors, indicating 
consistency between the DIS and DY data. The largest discrepancies are at the level of one standard deviation; they 
occur at small x, where the DIS and DY data have comparable precision. 

A better separation of sea and valence quark distributions in the DIS /DY fit leads to an increased precision of quark 
distributions, as shown in Fig. [3 The effect is more pronounced for the d-quark content of the proton. Both the 
u- and d- distributions obtained in the DIS/DY fit are smaller than similar distributions in the DIS fit at moderate 
values of x, but the difference is about la. The gluon distribution is practically unaffected by the DY data used in 
the fit, as seen in Fig. [3 

The theoretical errors of the DIS /DY PDFs due to variations of the renormalization and factorization scales do not 
exceed the "experimental" errors obtained by propagating statistical and systematic uncertainties in the data. The 
theoretical uncertainties have the largest impact on the isospin-symmetric combination of sea quark distributions, 
where the theoretical and experimental errors are comparable for x > 0.3; this is shown in Fig.|Sl The DIS/DY fit 



10 



Q =9 GeV^ 




0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 



o 4 
3.5 

2.5 7 

2 
1.5 

1 
0.5 

- 
-0.5 



Q =9 GeV^ 
DIS+DY(1<3) 

DIS+DY(|X=2M^^)-DIS+DY(n=M^^) 
CTEQ6(laJ 




0.05 0.1 0.15 0.2 0.25 0.3 0.35 0.4 



FIG. 8: The la errors on the isospin symmetric and anti-symmetric sea quark distributions due to uncertainties in data. The 
results of the current analysis (solid) are compared to that of the CTEQ collaboration (dots) and to the uncertainties due to 
variations of the renormalization ans factorization scales (dashes). The latter quantity with the DY cross-section calculated 
through NLO in perturbative QCD is also given for comparison (dot-dashes). 



constrains non-strange sea quark distributions with a precision better than ±30% for x ~ 0.7. The NNLO QCD 
corrections to the DY process are crucial for achieving this precision. If the NLO QCD theoretical prediction for the 
DY rapidity distribution is used in the fit, the theoretical uncertainty due to the renormalization scale variation is a 
factor of two larger than in the NNLO fit; as shown in Fig. |S1 it exceeds experimental errors in the isospin-symmetric 
sea distribution at large values of x. 

The similar error estimated in the CTEQ fit is an order of magnitude larger, as shown in Fig. |S1 One of the 
reasons for this disagreement is that in the CTEQ analysis, the criterion Ax^ = 100 is applied to account for possible 
inconsistencies in the data. In our case, good data consistency is a pre-requisite for assembling the data sample. 
Hence, we apply the standard criterion Ax^ = 1 that allows us to use the full power of the statistical analysis in our 
PDF determination. 



C. Phenomenological implications 

In this Section we briefly discuss some phenomenological implications of the above analysis. A broad measure of 
the consistency of PDFs with other observables is provided by the value of the strong coupling constant as{Mz). The 
strong coupling constant obtained in the DIS/DY fit, as{Mz) — 0.1128(15), agrees with the value obtained in the 
DIS fit of Ref. 3] within errors. It is interesting that PDF fits generally prefer smaller values of the strong coupling 
constant than the current world average value as{Mz) = 0.1176(20) i22], and that the inclusion of NNLO corrections 
into the fits makes the disagreement larger (see also the recent results of Ref. p^). 

Another interesting observable to discuss is the Pascos-Wolfenstein ratio. Recently, the NuTcV collaboration 
measured the Weinberg angle in neutrino-nucleon scattering and observed an anomaly. The significance of this 
anomaly is still an open issue since its interpretation depends on subtle details of the quark structure of the nucleon 
and on the correct application of QCD and electroweak radiative corrections 12^. While discussing these issues is 
beyond the scope of this paper, we would like to illustrate briefly the importance of improving PDFs at large values 
of X for the NuTeV analysis. 

For the sake of illustration, we consider the Pascos-Wolfenstein ratio 

Although the NuTeV collaboration does not measure this ratio directly, we assume that R~ is extracted from the data 
and is used to determine the Weinberg angle. The simple relation between Br and sin^ in Eq- ® is only valid for 
an isoscalar target. Since the iron target used by NuTeV is not isoscalar, there is a correction to the Pascos-Wolfenstein 
ratio nil 
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where A and Z are the target atomic weight and charge and 



1 



ajg^i = / da; a;(wvai ± dval)- (8) 







For iron , SR is a factor of ten larger than the NuTeV experimental error; hence the ratio x-^ /xq must be known to 
better than 10%. For the DIS/DY PDFs obtained in this paper, the value of Xj^/xq" at ^ 20 GeV^ is 0.4459±0.0094. 
The DIS /DY PDFs therefore suppress the errors in the determination of sin^ 6w due to the non-isoscalarity of NuTeV 
target to an acceptable value. We stress that inclusion of the DY data into the fit is crucial for achieving this accuracy. 
For example, in the NNLO DIS fit of Ref. 3] the value xJ"/xo = 0.4324 ± 0.0281 at = 20 GeV^ was obtained. In 
the global NLO fits by the CTEQ and MRST collaborations, these values are 0.4197 ± 0.0307 and 0.4317 ± 0.0204, 
respectively. 

The production of Z and W bosons at hadron colliders can be used to measure partonic luminosities 27]. In Fig.|51 
the NNLO QCD predictions for these rates calculated using the DIS/DY PDFs and DIS PDFs of Ref. 3] and the 
coefficient functions of Ref. are compared to recent Tevatron results |29j . The errors in the theoretical predictions 
arise from experimental uncertainties in the data used in the PDF fit; additional uncertainties come from varying the 
normalization factor As in Eq. Q by 40% and from varying the charm quark mass by 20%. Given the experimental 
errors on theZ and W production cross-sections, the theoretical predictions agree with the measured rates. The 
theory results obtained with the DIS/DY and DIS PDFs agree within one standard deviation, demonstrating good 
stability of the fits with respect to the selection of data. 
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FIG. 9: The preliminary Run II data for the W and Z production rates measured at the Tevatron. The NNLO theoretical 
predictions predictions are obtained with the DIS/DY PDFs and the DIS PDFs of Ref. |^. 



IV. CONCLUSIONS 



In this paper we extend the NNLO QCD analysis of proton PDFs performed in Ref. Q by including fixed target 
Drell-Yan data into the fit. The possibility to do so without compromising the precision is due to the computation 
of the dilepton rapidity distribution in the DY process through NNLO in QCD jl^, UM ■ When assembling the data 
sample, we pay particular attention to the consistency of the DIS and DY data. We find that the DY data does not 
agree with the DIS data for large dilepton rapidities; the disagreement actually becomes worse when the NNLO QCD 
corrections to the DY cross sections are taken into account. For this reason, we only include the E-605 data and the 
E-866 data on the ratio of proton and deuteron cross-sections in the combined DIS/DY fit. 

We find that the DY data improves the precision of sea quark PDFs at large values oi x, x > 0.1. The overall 
quality of the DIS/DY fit is good, with xV^DP = 1.13. The differences between the DIS/DY PDFs obtained in this 



12 



paper and the DIS PDFs derived in Ref. |3!| do not exceed one standard deviation, demonstrating good consistency 
of the data. 
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